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Abstract 

In this paper we study stochastic process indexed by 1 constructed from certain transition 
kernels depending on the whole past. These kernels prescribe that, at any time, the current 
state is selected by looking only at a previous random instant. We characterize uniqueness 
in terms of simple concepts concerning families of stochastic matrices, generalizing the results 
previously obtained in De Santis and Piccioni (J. Stat. Phys., 150(6):1017-1029, 2013). 
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1 Introduction and main definitions 


This paper is concerned with stochastic processes indexed by Z taking values in a finite alphabet G, 
constructed from a transition kernel which depends on the whole past, i.e. a map p : G x 
[0,1] such that, for any choice of wzfi = {w-i,W- 2 , ■ ■ ■) € p(-|wzfi) is a probability 

measure on G. In the literature these models appear under various names, as chains with complete 
connections [H], g-functions [3] or processes with long memory [4j. 

Two alternative ways to associate a G-valued stochastic process, i.e. a probability measure on 
G^, to a transition kernel are possible. The first deals with processes with a boundary condition. 
Let w = {wn,n € Z) be an arbitrary configuration, i.e. an element of G^, possibly random, and 
let r € Z; we say that = {Xn'^,n E Z) is governed by the kernel p with boundary condition 
w from the instant r, if = Wn for n < r and 




n—2’ 


.)=p(5|A;’yi,<’r2,...) a.s., 


( 1 ) 


for any n > r. It is clear that, given the law of w (in particular if it is a deterministic sequence), 
the law of this process is uniquely defined. If for some strictly decreasing sequence (r^), 
converges weakly in the product topology of G^, the limit is said to be an infinite volume limit. By 
weak compactness of the set of probability measures on a compact space, this set is non-empty: it 
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reduces to a single element if and only if converges weakly to fj,, as r ^ —oo, irrespectively 
of w. Infinite volume limits are mainly considered in the theory of multi-dimensional random fields 
dnuT]. Here we find convenient to borrow the standard usage in the multi-dimensional framework 
to introduce boundary conditions from an entire configuration on Z, rather than shifting to the 
left a configuration defined only on the half-line —N+. 

The second construction, more often used in the one-dimensional time-directional context we 
are concerned with, is to declare directly a process X = {Xn,n G Z), equivalently its law, to 
be compatible with p if ([1]) holds for any n G Z. This is analogous to the Dobrushin-Lanford- 
Ruelle definition in the theory of multi-dimensional random fields [IIKIT]. Compatible laws are 
immediately seen to be infinite volume limits; indeed if W is compatible with p and we choose 
it as a boundary condition, then X^’^ has the same law of W, for any r G Z: here we profit of 
having allowed random boundary conditions. Conversely, since © is equivalent to 








for any positive integer m and any real function h defined on G™, this relation is maintained in 
the limit provided p{g\-) is continuous for any g G G. In this paper only continuous kernels will be 
considered, therefore we will identify infinite volume limits with compatible laws, denoting their 
set with G{p). In the proofs both characterizations of G{p) will be found useful. 

Notice that elements of G{p) are not necessarily stationary, i.e. translation invariant, but from 
a non-stationary element of G{p) one can produce a stationary one by performing Cesaro averages 
of shifts over a finite window increasing to Z. Thus if G{p) reduces to a single element, it has to be 
stationary. On the contrary, it is possible that |f?(p)| > 1 but this set contains only one stationary 
element; indeed we will present later a situation in which this happens. Notice that, G{p) being 
convex, in case of non uniqueness G{p) has infinitely many elements. 

Uniqueness conditions for general transition kernels of the form ([T]) are scattered in the lit¬ 
erature for various decades. Some of these results refer to a dynamical systems setting, see e.g. 
[201 [26l dH] . The use of techniques of a more probabilistic flavor, in particular coupling techniques, 
has increased in time, see e.g. munns]. The work |3] has started a constructive approach, focused 
to the design of perfect simulation schemes for the unique compatible measure. In a number of 
cases this has allowed to prove not only the uniqueness, but also the existence of a compatible 
law, when G is countable. Finally, multi-dimensional statistical mechanics techniques, such as the 
Dobrushin criterion, have recently been used also in this setting mm- For perfect simulation in 
the multi-dimensional case the reader is addressed to e.g. laiTlEl]; also the continuity assumption 
can be relaxed, as in |6]. 

The various sufficient conditions for uniqueness usually take a suitable positivity condition 
together with some regularity assumption on the kernel p. The latter allows to control the behavior 
of the range of the functions p((7|wZ,':-) on for fixed g G G and wZ^ G as r gets 

large. Regularity assumptions of some sort are actually needed for uniqueness, as shown in |3|, 
where an example of a positive transition kernel has been given with a strong “dependence on the 
remote past” that gives rise to different infinite volume limits. 

In order to motivate the class of kernels considered in the paper it is useful to recall the setting 
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of [1]. In this paper they write down a decomposition of a continuous kernel of the following form: 

OO 

^(S'lwl^) = Oou{g) + '^ekP(k){g;w-i,....w-k), (2) 

k=l 

where v \s a probability distribution on G, 0 = {9n,n € N) is a probability distribution on the 
integers and for any k G N+ : G x —>■ [0,1] is a transition kernel depending only on the A:-th 
most recent values. If 0o > 0 and 9n decays to zero fast enough they provide a perfect simulation 
algorithm for the unique compatible measure. The first assumption corresponds to positivity of 
p{g\-), for some g £ G, whereas the second amounts again to a regularity assumption on the kernel 

p. 

The mixture decomposition presented in [3] is not unique. Other decompositions have been 
proposed to prove uniqueness laiiain], leading to relax not only the regularity but also the 
positivity assumption in [3]. 

In the present paper we consider general transition kernels of the following form 

^kP{k){'W-k,g), (3) 

kGA 

for some probability distribution 9 supported by ^ C N+ and P(^k) is a stochastic matrix on G, 
for any k £ A. Since YlkeA^k ~ kernel of the form ([3|) is clearly continuous. A transition 

kernel of the above form will be called an imitation kernel. 

When the P(fc)’s have the property that each row contains only a single positive entry, neces¬ 
sarily equal to 1 (as happens in particular for permutation matrices), the updating rule ([T]) means 
= fk{Xl^'^^) with probability 9k, where fk is a function on G obtained from P(k)- We refer 
to these cases as imitation kernels without noise. Otherwise we speak about imitation kernels with 
noise. Imitation kernels without noise are in some sense, to be clarified later, the most interesting 
to consider. 

For general kernels of the form ([3]) the value can be drawn in the following way. An integer 
Kr+i is chosen at random according to the distribution 9, and the value of the boundary condition 
Wr+i-Kr+i is read. Then is drawn from the Wr+i-Kr+i^^ row P(^Xr+i){'>JJr+i-Kr+i, ■) of the 

matrix P(Kr+i)- To perform this step, it is convenient to make reference to a sequence (/(fc), k £ N+) 
of coupling functions f(^k) : G x [0,1] —>■ G, having the property that, for any k £ N+, f(k){9-, U) is 
distributed as P[k){9^') whenever G is a random variable uniformly distributed in [0,1], for g £ G. 
So, if Ur+i is uniformly distributed in [0,1], Ur+i) yields This updating 

rule can be iterated to produce the values of the process at all sites n > r by drawing a 

random sample {Kn,n > r) from 9 and an independent random sample {Un,n > r) from the 
uniform distribution on [0,1]. It is clear that for imitation kernels without noise, the UAs are not 
needed for the construction of 

Rather than proceeding forward from the boundary sites, one can proceed backwards from any 
site of interest. In this case to produce the random variable Xfi'^, with n > r, we have to follow 
the random walk , /c G N) 

= for feGN, (4) 

k 
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(n) 

with Tg ' = n, whose distribution of decrements is 6. 

Let us define 

M(-) = mi{k : < r}, F^ (5) 

which are the minimum number of steps leading the random walk to land on a site below the 
threshold r and the landing site, respectively. The information on the boundary condition w is 
propagated forward by applying recursively the coupling functions in the following way 


-'fe-l 






( 6 ) 


starting from . Thus, at the end of the recursion one has, for any n> r 

f(") L- 


— h^r,n{KmtUm: ^ <^171^ Tl, Wy(n)) 


(7) 


for some suitably defined function Fr^n- 

One can appreciate here that, if an additional zero order term 9Qi7{g), with 0o > 0 and ly 
probability measure on G, appears in the kernel ([3]), the iL„’s can also assume the value 0 with 
probability 9q. When this happens, one stops the random walk from going further in the past 
and reads directly the value at that site by sampling from ly. Since this event will happen a.s., 
uniqueness always holds in this case. Incidentally, 9o > 0 means < 1, the Dobrushin 

sufficient criterion for uniqueness for this kind of kernels. It is not difficult to realize that a zero 
order term cannot be singled out when each of the columns of has a zero entry, for all k & A. 
In particular, this happens for imitation kernels without noise, except in the trivial case of some 
with all the rows equal to the same unit versor. 

Whenever for some pair of distinct sites m,n G Z, it happens that = T^'^\ for some 

positive integers h and k, we say that the two random walks started from the sites m and n 
coalesce. If this is the rightmost site in which this happens, we say that is the 

coalescence point of the two random walks. In this case one has = 7^+/) for any I € N, hence 
for r < , it is . As a consequence the values Wy(m) and Wy(n) coincide, conveying 

all the information about the boundary condition w needed to compute both and Xn^, by 
means of the functions and Fr^n defined in (j7|l. 

If the random walks started from n G A, with A arbitrary finite subset of Z, coalesce a.s. 
we say that the distribution 9 = {9n,n G A) is coalescent. In order to verify this property it is 
enough to check it for a window A made by two adjacent sites of Z. If we let two particles perform 
two independent random walks started from these two sites, with the rule that it is always the 
rightmost that moves, the distance between the two particles is a Markov chain on the integers, 
the so called von Schelling process m, up to coalescence. This is again a random walk with 
decrements following the law 9, but with a reflection around the origin once the negative half-line 
is hit. Thus coalescence of 9 means that from any n G N+ the return of this process to the origin 
is almost sure; this requires both some algebraic property for A and a control of the tail behavior 
of the 9As, see [23]. 
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Back to imitation kernels, we mention that already in [3] a particular class of binary kernels of 
the form ([3j) was examined, in which took only the two possible values 


h 


1 0 
0 1 


) J2 



( 8 ) 


for k ^ A, the so-called binary autoregressive kernels. However the presence of a zero order term 
0Qu{g) made the uniqueness problem trivial. In our previous work [9] we have considered binary 
autoregressive kernels with 8q = 0, equivalently with the H(fc)’s equal to either I 2 or J 2 , making 
a first step towards understanding the implications of the lack of positivity for imitation kernels. 
The main result of that paper is that for a coalescent 9 uniqueness holds. 

In the present paper the results are completely general, and not restricted to the binary case. 
The main result is that uniqueness for imitation kernels can be characterized completely in terms 
of the properties of what we call the G-stoehastic function induced by the imitation kernel ([3|) , 
namely the mapping 

k ^ A d N-i- I—^ ^{k)‘ 

The necessary and sufficient conditions generalize the well known concepts of irreducibility and 
aperiodieity for a single stochastic matrix. 

Irreducibility is discussed in Section [2l Since the presence of two irreducible classes implies the 
existence of two different compatible laws (Proposition[2]) and states not belonging to an irreducible 
class cannot appear in the support of a compatible law (Proposition [3]), we are allowed to focus 
our further study to irreducible kernels. 

Aperiodieity is the subject of Section [3l Here a difference with the case of a single stochastic 
matrix appears: the states are constrained to have a period which is a multiple of the ged d{A) of 
A. But since any element of G{p) has independent marginals along the residual classes mod d{A), 
the uniqueness problem is reduced to any of them, for which with an obvious rescaling d{A) = 1. 
This is the content of Proposition^ which allows to correct a mistake occurred in [9]. Furthermore, 
as it happens for finite Markov chains, the presence of several periodic classes implies the existence 
of different non-stationary elements of G{p), obtained one from the other by shifts (Theorem [1]) . 
Notice that this kind of phase transition is entirely different from the one in [3] that concerns a 
positive kernel. At the end of the section we prove an important lemma relating the stationary 
elements of G{p) with the invariant distributions of the stochastic matrix P = 

In Section [3] we prove our main result (Theorem [2]), which is analogous to the ergodicity 
theorem for finite Markov chains: uniqueness holds for irreducible and aperiodic imitation kernels. 
The unique invariant distribution A of H is identified as the single-site marginal of the unique 
compatible law. 

The fact that the tail behavior of the 9k s does not enter in the uniqueness conditions entails 
that, by keeping A fixed, but distributing enough mass to larger values of k € A, we can construct 
examples of uniqueness in which any of the general sufficient conditions appeared in the literature 
fails. The uniqueness theorem appearing in [9] is found as a particular case, without assuming 
coalescence (Theorem [3l). 

In Section [5] we propose two simulation algorithms of the CFTP type [23], to construct the 
unique compatible law on any finite window of Z. The first, presented in Theorem [U works when 
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the distribution 0 is known to be coalescent. When 6 is not coalescent or at least this is unknown, 
a threshold has to be specified, introducing an error in the algorithm. In Theorem [7] we prove that 
the error introduced in this way can be made arbitrarily small pushing the threshold towards — oo. 
For this reason we call it an e-perfect simulation algorithm. In a situation of non uniqueness, the 
algorithms presented here can still be used to construct any stationary element of Q{p). In the 
irreducible but non-aperiodic case, it can also be proved that there is only one stationary element. 

Finally, in Section [6] we present a result for the case of countable G. A sufficient condition for 
the existence of a unique element in Q{p), together with a perfect simulation algorithm is obtained. 
For finite G, such a condition reduces to irreducibility and aperiodicity. The algorithm eliminates 
the approximation error for non-coalescent 0 but it can be considerably more complicate for G 
large or infinite. 


2 Irreducibility of G-stochastic functions and uniqueness 

We start with a brief discussion of mappings defined on some A C N+ with values in the set of 
stochastic matrices over the set G. We call a mapping of this type a G-stochastic function. 

Recall that the free semigroup generated by A is the set A* = UneN+ finite n-tuples with 

elements in A, for all positive integers n, called words in the sequel. It is indeed a semigroup under 
concatenation, defined for a = (ui,..., an) and b = (6i,..., bm) by ab = (oi,..., Un, bi,..., bm). 
A G-stochastic function defined on A extends to a homomorphism of the semigroup A* into the 
semigroup of stochastic matrices on G by associating to each a = (ai,..., an) the stochastic matrix 

P8L = P{an)---P{ai)- (9) 

Likewise, for any a = {ai,... ,an) G A*, we define the composition of coupling functions 
/a : G X [0,1]" ^ G as 

/a (s') ■ 7 ^n) f{a.i) (/(a 2 ) (• • • /(a„_i) if (an) idi ^n)i '^n—l)) • • • ) ^ 2 ): '^^l) 7 

for gr G G and Uj G [0,1], f = 1,..., n. We define the depth of a as s(a) = 

Let us define a directed graph F^ with the sites of Z as vertices, and arcs joining n G Z with 
n — k, whenever k a A. It is natural to visualize the elements a G A* as paths of the graph Fg). 
Once we have weighted the arc (n, n — k) with the probability 9^ > 0, we can assign a probability 
to any path, given by the product of the probabilities of the arcs belonging to the path. Now, 
for a = (oi,..., am) G Pa is the stochastic matrix used to compute the value of Xn'^ from 
the value for n - s(a) > r, whenever Kn = ai,Kn-ai = a 2 ,..., Kn-ai-...-am-i = «m: an 

event which has probability 0a = ^ai ■ • • • ■ 0a^ ■ Notice that infinitely many paths are associated 
to each a G A*, differing in the starting site in Z. Also observe that the depth s(a) is the distance 
of the last site of the path from the first one (see Fig. I). 

The sample K = [Kn, n G Z) selects a particular random subgraph F^ of F^i, made of the arcs 
(m, m — Km), for all m G Z. Likewise, the random walks 'TA) = {Tj^\ k G N), for n G Z, can be 
actually seen as random walks on the graph Fg. 
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• ••••••••• 


Fig. 1. The word a = (2,4, 3) and the corresponding path. 

We can extend to a G-stochastic function a number of concepts which are well known 
for the “standard” case of a single stochastic matrix P, i.e. the G-stochastic function defined on 
A = {!}, with = P. 

Definition 1. Let P(.) be a G-stochastic function defined on A and i ^ j & G. We say that i ^ G 
P(^.ycommunicates with j £ G if there exists a. £ A* such that Pa{i,j) > 0- say that i and j 
P(^.yintercommunicate when i communicates with j and vice versa. 

As usual for the standard case, we declare that each i £ G intercommunicates with itself, so that 
intercommunication becomes an equivalence relation and G is partitioned in intercommunicating 
classes. 

Definition 2. An intercommunicating class C is closed when i £ G communicates with j £ G 
implies j £ G (therefore j communicates with i). We say that a G-stochastic function Py is 
irreducible if for any i and j £ G there exists a G such that Pa{i,j) > 0? is the whole G is 
the only intercommunicating class. More generally, Py is essentially irreducible if there exists a 
single intercommunicating class. 

It is easily verified that Py is irreducible if and only if P is irreducible. As in the standard 
case it is possible to decompose G in a rather familiar way. 

Proposition 1. Let Py be a G-stochastic function. There exists a unique partition {Pi,..., Rs,T} 
of G, with s > 1, where 

1) Rh is a closed intercommunicating class, thus {P(k) ^ *-5 irreducible, for h = 1,..., s; 

2) for any i £T there exists j £ RiLi .. .U Rg and a G such that Pa{i,j) > 0. 

Proof. The proof is completely analogous to that for a single finite stochastic matrix. In that case 
the sets Rh, h = 1,..., s represent the recurrent states, whereas the set T, which is the union of 
the intercommunicating classes that are not closed, represents the remaining transient states. □ 

The reader will notice that the above decomposition coincides with that concerning any convex 
combination of the stochastic matrices Py)A ^ with positive weights, in particular the matrix 
P. The following result allows to rule out a trivial case of non uniqueness for kernels of the form 

©. 


Proposition 2. If the G-stochastic function Py has more than one closed intercommunicating 
class then for the kernel dsn one has \Q{p)\ > 1. 
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Proof. The restriction of a kernel ([3|) to a closed intercommunicating class Rh is by itself a transition 
kernel pR^ on Rh- Identifying Q{pRyf) with a subset of Q{p) in the natural way, and taking into 
account that GipR^f) are non empty and disjoint, for h = 1,..., s, the statement of the theorem is 
immediately obtained. □ 

The simplest example of this sort is contained in [9]: if = I 2 for all k € A, then the 
two states are two closed classes and the Dirac measures on the two constant sequences are two 
elements of G{p). 

The following proposition ensures that, when there is only one communicating class, we can 
restrict the kernel to it. 

Proposition 3. For a transition kernel of the form ([3j), consider the corresponding G-stochastic 
function P(.) and suppose that R is the union of all closed intercommunicating classes. Let pR be 
the restriction of p to R. Then G{p) = G{pr) for the kernel l|3|l. 

Proof. The main step is to prove that if X ~ /r € ^(p), then, for any n € Z, P(X„ € R^) = 0. For 
this it is enough to prove that 

lim P{Xlf’^ G P") = 0 (10) 

r—^—oo 

for any w G G’^. 

Starting with Zq = R^, we define recursively a sequence of subsets C P^, for /i = 1,..., with 
strictly decreasing cardinality, until for some integer L it \s Zr = tb. During this construction we 
will define slr G A* of length hr and Borel sets A/j C [0,1]”'* of positive Lebesgue measure Lehn^, 
for /i = 0,l,...,L — 1. The recursive construction is given by 

-2/1+1 = {/ah(b A/i),i G Zh} nP^ 


and has the properties 

1. {/a^(i, A/i),i G Zh} nR^ib, h = 0,...,L-l 

2. For any i G Zr, \U^{i,Ah)\ = 1. 

Let us explain the generic step h of the construction. Fix an arbitrary state j G Zh. By the 
definition of P there exists a word a/^ G A* of length Uh such that Pa^ (j, P) > 0. This ensures 
the existence of A^ with a positive n/i-dimensional Lebesgue measure such that /a^(j, A))) G P, 
hence Property 1 is guaranteed. To obtain Property 2 one may need to reduce A)) to some smaller 
Ah C A^ keeping positive Lebesgue measure, which is clearly always possible by finiteness of Zr. 

Now we are in a position to prove (|10ll . Define the word a of length m = no + ui + ... + ur-i 
by the concatenation a = a+_i...ao and the Borel set A = A+_i x ••• x Aq C [0,1]™. Let 
c = 0a • Lebm(A) > 0. 

Recall the recursive construction of Xn'^ in terms of the random walk T’A) = G N} 

with the corresponding sequences {Kh = K (n),k G N}, obtained through the relation (j3|), and 

let {Uk = U („),k G N}. If, for some integer I, a segment (PT/,..., P/+m, D/,..., P/+m) belongs 

J u 
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to a X A with > r, then Xn'^ G R, irrespectively of w. Since a segment of this kind will 

eventually occur with probability 1, (fT0]l holds. 

As a consequence, if /U G Q{p) then p{R^) = 1. Moreover, being /r compatible and supported 
by R^, it is actually in G{pr). □ 


3 Periodicity of G-stochastic functions and uniqueness 

In the previous section we have justified to restrict our attention to irreducible G-stochastic func¬ 
tions In this section we turn our attention to the notion of periodicity of a state i € G, which 
is slightly more delicate than in the standard case. We start by observing that the depth s is a 
homomorphism of the free semigroup A* into the additive semigroup of positive integers N_|_. As 
a consequence s(M*) is a sub-semigroup of N+. The period of A is defined as 

d{A) = gcd{s{A)} = gcd{s{A*)}. (11) 

If 1 G M, as it happens in the standard case, then s(M*) coincides with N+, and d{A) = 1. Recall 
that, except for a finite number of elements, an additive semigroup of positive integers has always 
the form {nod, (no -|- l)d, • • • }, where d is the gcd of the semigroup and no is a suitable positive 
integer. 

Now suppose that the transition kernel p in ([3]) has d{A) > 1. Then it is natural to move from 
p to 

P(5|wl;^) = '^Oid(_A)Pgd{A))i'W-i, g), (12) 

l&A 

with A = {I : ld{A) G A}, so that d{A) = 1. The following proposition allows us to set d{A) = 1 
in all the uniqueness proofs of the present paper, without restriction of generality. 

Proposition 4. For a transition kernel p of the form ([3]) let d{A) > 1 and define p as in (fT^ . 
Then uniqueness holds for p if and only if it holds for p. 

Proof. Let us consider the process governed by the transition kernel p. Then for any h = 

0,1,..., d{A) — 1 the d(M)-marginal process defined by 

t e z (13) 

is governed by the kernel p, with boundary conditions = (rch-i-fc(i(^) > ^ £ ^)) with r^^'^ = 

maxjfe : h + kd{A) < r}. If uniqueness holds for p, then converges weakly as r —oo to 

the unique element g of G{p), for any choice of ■w G G^. Let Y be a process with distribution 
g. Likewise the process X(/j) converges weakly to = (Yh-i-A:d(^)i ^ ^ ^)) as f —oo which 
proves that uniqueness holds also for p. 

For the converse notice that, for any r, conditionally to w G G^, the d(M)-marginal processes 
X(;j), defined in (fT^ . are independent, for h = 0,1,... , d(M) — 1. By consequence, if g is the 
unique element in G{p), each of the d(M)-marginal processes converges to it, so the whole process 
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has a limit distribution with the (i(yl)-marginal processes ^ distributed and independent, 
which ends the proof. 

□ 

The previous proposition corrects the erroneous statement contained in our paper [9] (see 
Proposition 1 and Theorem 1) that d{A) = 1 is necessary for uniqueness. 

Next assume that d{A) = 1 and define the period of z G G to be dj = gcd{s{A*)), where 
A* = {sl ^ A* Pa{i,i) > 0}. If dj = 1 we say that the state i £ G is aperiodic for P(.), The 
following proposition guarantees that, for irreducible G-stochastic functions, we can refer the term 
to the whole function, since all states have the same period. 

Proposition 5. Let be an irreducible G-stochastic function defined on A, with d{A) = 1. 
Then di = d, for i £ G, for some d £ N+. Moreover, there exists a partition of G in sets 
{Gh, /i = 0,..., d — 1} such that 

Phihj) > 0,i £ Gh ^ j £ Gh+s{h), 


identifying with Gh- 

Proof. By irreducibility for any pair i,j £ G there exists ai,a 2 £ A* such that P^fii, j)Pa 2 {jfi) > 
0. This implies that Pa 2 aii'lfi) > Pai{i, j)Pa 2 ijfi) > 0; so s(a 2 ai) = s(ai) + s(a 2 ) = kdi- Moreover 
if Ph{i,i) > 0, then PaibaaCjG') > Pa 2 {jfi)Ph{ifi)PaAh j) > 0, thus ai^*a 2 C A*, from which 

s(ai) + s(.T*) + s(a 2 ) = kdi + s(^*) C 

This implies that dj < di- Exchanging the roles between i and j one gets dj = dj as promised. 

For the second statement let us fix some reference state k £ G, and, for any d = 0, l,...,d — 1 
define the subsets of G 

Gh = {i £ G : Pa{k, i) > 0 for some a € with s(a) = nd + h, n £ N}. 

By irreducibility the union of the Gh’s is the whole G. Now suppose that i £ Ghi H Gh2- Then 
there exist ai £ A* with s(ai) = nid + hi and a 2 £ A* with s(a 2 ) = n 2 d + /12 such that 
Pai{k,i)Pa 2 {k,i) > 0. We can safely assume that rei = n 2 = n, since s{A’^) contains all the 
multiples of d large enough. Next let b G such that Pb(b k) > 0: we will have that 

^bai(fc,fc)Pba2(^,fc) > Ph{kkfPaAk,i)Pa,2{k,i) > 0, 

which implies that s(bai) and s(ba 2 ) are multiples of d. Hence 

s(bai) — s(ba 2 ) = s(ai) - s(a 2 ) = /ii - /12 

must be a multiple of d. Since |/i 2 — /ii| < d this happens only when hi = h 2 - So we have proved 
that {Gh, d = 0,1,... , d — 1} is a partition. 
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Next assume Pb(^,j) > 0, z € Gh,j € Gi. By assumption there exists a & A* such that 
Pa{k,i) > 0 with s(a) = nd + h. Then Pba(fc,j) > Pa{k,i)P\){iA) > 0, so 

s(ba) = s(b) + s(a) = s(b) + nd + h = md + I 

for some integer m, from which 

s(b) = (m — n)d +{l — h)^l = h + s(b), mod d, 


as desired. □ 

We can immediately make profit of the previous proposition to establish the following result. 

Theorem 1. Let p he a transition kernel of the form ([3]) and let d{A) = 1. If P(-) is irreducible 
but not aperiodic (i.e. d> 1) then \Q{p)\ > 1. 

Proof. By Proposition [5] the classes Go,Gi,... ,G^_.^ are well defined. Let us select an element 
from each class, say € Gj, for i = 0,..., d — 1 and define Wk = Qi k is congruent to i mod{d), 
for k & Z. Finally define w = (wk : /c € Z) and the translated w, with Wk = Wk+i, for A: € Z. 

Recall that any probability measure in G{p) can be obtained as a weak limit of the laws of 
for r f —oo. These laws give probability one to the event {Xq G Gq}, so this remains true 
in the limit. On the other hand the weak limits of the laws of give probability one to the 
event {Xq G Gi}, therefore the measure are necessarily distinct. This ends the proof. □ 

An example of application of Theorem [1] is the situation examined in [9] . If the image of A 
under is made by the matrices p and J 2 , then is irreducible. This happens also if this 
image is the singleton {J 2 }. Now suppose d{A) = 1 and, for A: G ^ and odd, = J 2 and for 
k ^ A and even, P{k) = h- Then, for any state i ^ G, s(gI*) is made only by even numbers, thus 
di is a multiple of 2. It is precisely 2 since d{A) = 1. Thus is not aperiodic. The two elements 
of G{p) constructed in Theorem [1] are Dirac measures supported by the two coherent sequences 
alternating the two states, as defined in [9]. Clearly these measures are not stationary: as a matter 
of fact the convex combination of them with equal weights is the unique stationary element in 
G (p) ■ More generally, under the hypotheses of Theorem [H we will prove in Section [5] that G (p) 
contains only one stationary measure. 

We close this section focusing our attention on the set I of invariant distributions for the 
stochastic matrix P = YlkGA^kP{k)- 

Lemma 1. The following statements hold: 

1. Let A G T and w be a configuration with X-distributed single-site marginals. Then, for any 
r G Z, X^’'^ has the same property. 

2. For any A G X there is at least one element of G{p) with single-site marginals equal to A. 
This element can he always chosen to be stationary. 
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3. A stationary element ofQ{p) has all its single-site marginals equal to an element of I. 

4 . If there is a unique (stationary) element in Q{p), there is a unique invariant measure for P. 

Proof. For 1. it is enough to notice that, by induction on n > r, and interchanging the two sums 
of positive terms, 

Pixf’^ = g) = = g I i<n)) = Y, Ok{\Pik)){g) = {\P){g) = Kg)- (14) 

keA 

For 2. take as above and send r to — 00 : any weak limit point will be in G{p) and it will keep 
the single-site marginals equal to A. By taking Cesaro averages on a window increasing to Z and 
going to the limit one gets at least one stationary process with single site marginals still equal to 
A. For 3., let X G ^(p) be stationary and call A its single-site marginals: then, similarly to (|14l) 

Kg) = PiXn =g)= E{PiXn =g\Xi,i<n)) = J2 ^k{\P(k)){g) = i\P){g). (15) 

k&A 

4. is an immediate consequence of 3. □ 

By the remark following Proposition [H it is immediately seen that X has a single element if and 
only if Pj-.) is essentially irreducible. So this is a necessary condition for the existence of a unique 
stationary element in G {p) ■ At the end of Section [5] we will able to prove that this condition is 
also sufficient. 

4 Main results 

After the results of the previous section, it remains to consider transition kernels of the form ([3j) 
with a corresponding G-stochastic function which is irreducible and aperiodic with d{A) = 1. 
In this section we prove that uniqueness holds for all of them. Here is the main result. 

Theorem 2. Letp have the form ([3]) and assume that is irreducible and aperiodie. Then G{p) 
has a unique element p, with single-site marginals equal to X, the unique invariant distribution of 

P. 


Before giving the proof of this result, we prove a weaker result in order to present the basic 
argument in a simpler context. Recall that under the above assumptions, P has a unique invariant 
distribution. 

Lemma 2. Let p have the form ([3]) and let be irreducible and aperiodie. Then, for any n G Z 
the distribution of Xn'^ converges to X, as r ^ — 00 , irrespectively o/w G . 

Proof. Without loss of generality, we take d{A) = 1 and n = 0. We are going to couple in a 
suitable way the random variables Xq'^ for any w G G^, in such a way that, as r — 00 , they all 
converge to the same random variable a.s. 
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Let io be a fixed state in G. By irreducibility the additive semigroup is non empty and, 

by aperiodicity, it has a gcd equal to 1: by the characterization of these semigroups, there exists 
mo G N+ such that, for m > mo, then m € Now use again irreducibility to prove that there 

exists no G N+ such that, for any i,j G G one can choose a word hij G A* with P^..{i,j) > 0 and 
s(bjj) = no- We call B the collection of these (distinct) words. These words can be identified with 
paths in Te with the same depth no- For any b G B we call 9^, its probability and let p = 6*b 

be the sum of the probabilities of these paths. 

Next we are going to construct the random walk = {T^^\ A: G N} by generating {K (o ), k G 

k 

N}, according to (jl|). Let be the interval between whose endpoints a path corresponding 

to a word in B appears for the h-th. time in the random walk (it is understood that such a word 
can vary with h). By definition u® — = no (see Fig. 2). The independence of the decrements 

of the random walk ensures that, with probability 1, a sub-walk in B will appear infinitely many 
times. Once reaches the site the value Xq'^ can be obtained from w m) by using the 
recursion in ([6]), with an exception for each of the sub-walks in B identified before. Suppose that 
one of these sub-walks joins s = with s — no = s — no > r. Then, conditionally to 

this event, the transition from to follows the transition matrix 

Q = iV0bFb. (16) 



• ••••••••••• 



Fig. 2. An example with B = {(1, 2), (2,1)}. Paths in B appear above the sites. 

This matrix is positive by construction, therefore there exists a coupling function /* : G x 
[0,1] —)■ G with the following property. If U is uniform in [0,1], /*(fl', U) has the law Q{g, •), and 
there exists e > 0 such that for u < e 

f*{ 9 ,u) = f*{h,u),\/g,h G G. (17) 

It is enough to number the states and use the Skorohod representation. Thus any transition from 
the left endpoint to the right one of the segments can be performed by drawing some 

independent random variable U with uniform distribution in [0,1]. When U < e occurs, then 
does not depend on and thus does not depend on w. Once this coupling occurs, by 

following the recursion ([6]) one obtains that Xq^ does not depend on w as well. 

Since each time a sub-walk in B occurs the U used by the coupling function /* are independent, 
coupling will happen with probability 1 and thus, as r —>■ — oo, Xg^ converges a.s. to a random 
variable which does not depend of w. Finally take w with all the single-site marginal equal to A. 
Then, by Lemma [H X^’'^ has law A for any r, and this law is kept by any weak limit point. This 
ends the proof. 

□ 
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Proof of Theorem [H Again, for any finite subset A C Z, we have to couple all the processes 
for all r € Z and w € in such a way that, as r —>■ —oo, they converge a.s. to the same limit 
vector, hence they share the same limit in law. 

Thus, we proceed first to the construction of the random walks k G N}, for n G A, 

by using the i.i.d. 0-distributed random variables {Km,m G Z). The intervals where a 

path in B occurs for the /i-th time, h = 1, 2,... are located within the random walk for any 

n G A, with the constraint that distinct intervals which overlap are discarded. 

We recall that the distance between two different random walks can be seen as a von Schelling 
process until the possible coalescence. The results in [2] imply that when d{A) = 1 it is recurrent if 
and only if the distribution 8 is coalescent. In this case any finite family of random walks coalesce 
a.s.; otherwise this process is transient and any pair of distinct random walks and pA) either 
coalesce or their distance goes to infinity a.s. 

The set A is partitioned according to the following equivalence relation: m = n whenever 
7 "("i) and coalesce. If this happens let Sm,n denote their coalescence point, otherwise we set 
Sm,n = —OO. For each of the equivalence classes Ci,..., G;, let 




ml{Sm,n,rn,n eCh), if |Gh| > 2, 
n, if Ch = {n}. 


(18) 


for h = 1,... , L For a coalescent distribution 0, / = 1 a.s. Otherwise the random walks do 

not coalesce, for h = 1,..., / and their distance goes to infinity with probability 1. 

Therefore there are infinitely many non overlapping intervals where a path in B occurs, for 
each of these random walks. With the same argument used in the previous proof we get that, 
with probability 1, provided r is sufficiently close to — oo, converge to A, irrespectively of w, 

independently for any h = 1,... ,1. Since any random walk P^"^\ for m G A, necessarily hits the 
set {5(01, • • •) ‘S'Ci}, by forward iteration of Q it is then possible to obtain the required limiting 
values for Xn'^, for all n G A. □ 


Next we apply the previous theorem to the class of binary autoregressive kernels examined in 
[^. Theorem 3 in this reference proved uniqueness under conditions (a) and (b) therein. Now 
condition (a) is nothing but irreducibility and aperiodicity of the associated G-stochastic function. 
Condition (b) assumes coalescence so, as a consequence of Theorem [2l this condition appears to 
be unnecessary for uniqueness. Altogether we have the following uniqueness theorem. 

Theorem 3. Consider a transition kernel p of the form ([3]) with |G| = 2 , d{A) = 1 and the range 
of P(^.^ contained in with p and J 2 defined in ([8]). Then uniqueness holds for p if and 

only if there exists m & A even with P(rn) = J 2 , or at least mi, m 2 G A, both odd, with P[mi) = J 2 
and 

Proof. It is clear that these conditions ensure that .p is contained in the range of (which is 
equivalent to irreducibility) and that they exclude that the odd elements of A are sent by into 
J 2 and the even into p (which is the only periodic case). □ 

By putting together all the results proved so far, we finally get the following 
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Theorem 4. For a given kernel p of the form ([3]) one has uniqueness if and only if the correspond¬ 
ing G-stochastic function P(.) is essentially irreducible and its single intercommunicating class is 
aperiodic. 

5 Perfect and e-perfect simulation 

In this section, under the conditions of Theorem [2] we construct simulation algorithms for the 
unique element p, of G{p) on a finite set of sites A C Z. Recall that we have assumed without loss 
of generality that d{A) = 1 , so the coalescence property of the distribution 9 is equivalent to the 
recurrence of the corresponding von Schelling process. 

The problem of determining conditions for coalescence of 9 has been recently addressed in [23], 
where it has been established that the following tail condition 

OO OO 

k=l n=k 

is sufficient. Notice that this is weaker than the finiteness of the mean of 9, which has the same form 
but without the square. The latter is equivalent to the positive recurrence of the corresponding 
von Schelling process, which is in turn equivalent to the finiteness of the mean of the coalescence 
time of any two of the random walks G Z. In [23] an example of transient von Schelling 

process was also given. 

In the uniqueness regime, if 9 is coalescent a perfect simulation algorithm for the marginal 
distribution p\ in any finite window A C Z can be designed. Indeed, in this case the coalescence 
point Sm,n is finite a.s. for any n,m (z A, and so is the coalescence point 

5a = inf{5m,n, m / n G A}. (19) 

Moreover these coalescence points are all adapted to the filtration 

jrmaxA _ < 72 < max A), s < min A, (20) 

which makes them accessible through sequential simulation. The following proposition essentially 
coincides with a result appearing in [9] in a particular case. 

Theorem 5. Consider a transition kernel p of the form ([3|) with d{A) = 1, a coalescent distribution 
9, and irreducible and aperiodic. Let A be the unique invariant distribution of the stochastic 
matrix P. Then, for any finite A C Z the random vector Xa = (X„,n G A) given by Simulation 
Algorithm 1 is distributed as /ta, the marginal on A of the unique element p of Q{p). 

Simulation Algorithm 1 

1 . Construct the random walks ^ A up to their coalescence time 5a (see Fig. 3); 

2. Sample ^ 5 ^ ~ A; 
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3. Keeping the Km used in the first step and sampling Um i-i.d. uniform in (0,1) for S\ < m < 
max A, compute 

Kfi — FSf^,n{,Kmi Urm m ^ n, n (E A. 



Fig. 3. An example of coalescing random walks, with A = {2fc, k € N+} U {3}, for the simulation of the 
two rightmost adjacent sites. The leftmost site is drawn from the distribution A. 


Proof. By assumption we are working in the uniqueness regime, so we can approximate //a in total 
variation norm with the law of with arbitrarily chosen boundary condition w, as r —)■ —oo. 

Choosing the components of w to be i.i.d. from the law A we can make profit of Lemma [TJ By 
the strong Markov property applied to the random walks T^'^\n G A, conditionally to {S\ = s} 
with s > r, is independent of jrmaxA distribution of A. By consequence we can 

represent the random variable X produced in Step 2 of the algorithm as which implies that 

on the event {5 a = s} with s > r, 

K’" = Fs,n{Km, Um,s<m<n; XJ’-) = X^,n G A. 

By the coupling inequality (see e.g. [22]) we have that the total variation distance between the law 
of X)^'^ and that of the output Xa of the algorithm is bounded by the probability that Sa < r. 
By sending r to —oo the proof is completed. □ 

Notice that if the unique invariant distribution A for the stochastic matrix P is difficult to 
compute, one can use a perfect simulation algorithm for hnite Markov chains to obtain a sample 
from it. 

In the non-coalescent case the simulation algorithm of the previous theorem is unfeasible since 
the coalescence point 5 a of A is not finite with probability 1. Even if the distribution 6 is coalescent 
we may need to stop the simulation when it goes beyond some large negative threshold because 
of memory and time limitations. A fortiori this needs to be done if we are not able to prove 
coalescence. In all these cases it is still possible to produce a sampling algorithm, provided a 
certain small error is accepted. In order to evaluate this error we need to introduce the following 
random time 

5a = inf{5n,m : n.,m G A,5n,m >-oo}. (21) 

In case the set appearing at the r.h.s. of (f2T]l is empty we define 5 a = min A. As a consequence 
5 a is finite, but when 9 is not coalescent, it is not adapted to the hltration (l20]l - 

Here is a e-approximate simulation algorithm, where e is an error which goes to zero as the 
threshold site u G Z appearing in the algorithm decreases to —oo. Indeed, 5 a being finite, in 
principle it is possible to select u sufficiently close to —oo to make the r.h.s. of the forthcoming 
(1231) smaller than any hxed e > 0. In practice, the determination of a tail estimate on the 
distribution function of 5 a can be extremely complicate. 
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Theorem 6. Consider a transition kernel p of the form ([3]) with d{A) = 1 and let irreducible 
and aperiodic. Let A he the unique invariant distribution of the stochastic matrix P and p he the 
unique element of Q{p). Let A C Z he finite, and let u < min A. Let he the law of the random 
vector = (X“,n G A) defined by the following 

Simulation Algorithm 2 

1. Construct the random walk until is reached, for n G A; 

2. For any m G : n G A} sample ~ A, independently; 

3. Keeping the Km used in the step 1 and sampling Um i-i-d. uniform in (0,1) for u < m < 
max A, compute 

Xll = Fu,n{Km,Um,u < m <n;X\„)), n G A. (22) 

Then, in total variation norm 

\\pI-Pa\\<P{Sa<u). (23) 


Proof. Choosing the same boundary condition w as in the previous theorem we can decompose 
the formula ([7j) in the following two steps: first the boundary conditions are propagated up to 


XfiT = Fr,m{Ki, Ui,r <l<m-,wvf X me 


(24) 


for any u < min A. Next the random variables at A are constructed 


= Fu,n{Ku Uuu<l<n-X\X), n G A. 


(n) 


(25) 


Now we construct on the same probability space the random vector ^ produced by step 2 
of the algorithm. It will be shown that, when the event {^a > u] occurs, X^„ a will coincide with 
as given by (1240 . Therefore, on this event also , given by (j25p . will coincide with the 
output of the algorithm X)( given by (l2^ . By applying again the coupling inequality and sending 
r —>■ —oo the proof will be concluded. 

For the last step we define i.i.d. sequences (Kj^\ul^^^),r < I < u, , independently for any 
77 J ^ yu,A^ From each site m, by means of the independent random walks are started. 

As done in the case of two walks, we assume that the rightmost of them is updated first. The 
same {KpUifs defined to construct X'’’’^ are used, except at coalescence sites, where additional 
independent copies are sampled. Analogously to (0), let us define to be the site where the 
random walk starting from m lands under the threshold r. At this site we use the original boundary 
condition appearing in X^’'^. If more random walks land at the same site, additional independent 
copies are sampled as before. 

These boundary conditions are propagated forward in time on each walk, using the 
until the starting points m G are reached. Following the notation of ([7]), we have 


'srr 
-/V >-) 




= Fr 


,(KC,u!”‘>,r<l<m;w%), m £ V" 

Vr 


(26) 
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(m) 


By Lemma [H Xm'^' ' ~ A, and they are independent by construction, so they are identical in law 
to the X^'s generated in step 2 of the algorithm as promised. 

Finally, observe that when Sj\ > u, no site is visited twice by any of these random walks and 
therefore, for any m € V“’^, the random variables Xm'^ defined by (I24p coincide with Xm'^ in 
Using the coupling inequality and sending r to — oo concludes the proof. □ 


As remarked by one of the referees, the previous two algorithms work also in case of non¬ 
uniqueness to construct any stationary element of 0(p)- Which element is picked up depends on 
the invariant distribution of P used in Step 2. In the essentially irreducible case the matrix P is 
itself essentially irreducible, so its unique invariant distribution is the only possible choice. 

Corollary 1. Consider a transition kernel p of the form ([3]) with d{A) = 1 and let P(^.'^ irreducible 
but not aperiodic. Let A be the unique invariant distribution of the stochastic matrix P. Then 
Q{p) has a unique stationary element ps- Let A C Z 5e finite. Simulation Algorithm 1 (for coa- 
lescent distributions 6) constructs a random vector distributed as ps,A- Simulation Algorithm 
2 constructs a random vector converging to ps,A as u ^ —oo. 

Proof. Consider any stationary element fi oi Q (p). By Lemma [T] it has necessarily A-distributed 
single-site marginals. Take the boundary condition w appearing in the proof of Theorem [5] and 
[6] to be //-distributed. Then has the distribution /tA. Since this is either the law or it is 

arbitrarily close in total variation to the random vectors constructed by these algorithms, which 
do not depend on the choice of fi, one establishes both the statements of this corollary. □ 


6 A result with G countable 

In this section G is allowed to be countable. In this case the complete characterization presented 
in Theorem 0] fails, despite the fact that the conditions of essential irreducibility and aperiodicity 
continue to make sense. Indeed, by the lack of compactness of the sets of probability measures 
over existence is not guaranteed. On one side this prevents the construction of more than 
one compatible law when essential irreducibility and/or aperiodicity do not hold, and on the other 
requires to strengthen these assumptions to prove existence and uniqueness. In this section we 
are going to provide an assumption of Doeblin type that, in case G is countable, allows to prove 
existence and uniqueness of compatible laws. For definiteness assume that either G = N+ or 
G = {1,...,|G|}. 

Hypothesis D. For a kernel p of the form ([3]), there exists a certain state, say I without loss 
of generality, and an integer no € N+, with the following property 

Vi € G, 3bj G A* with s(bj) = hq such that inf Phi{i, 1) =: e > 0. (27) 

i€G 

Whereas in the countable case this assumption is strictly stronger than essential irreducibility 
and aperiodicity, if G is finite it is actually equivalent for the following reason. First, the fact 
that for any i G G one has Phiii, I) > 0 for some bj G A* it implies that two different irreducible 
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classes cannot exist. Second, the fact that these words can be chosen with the same depth denies 
the existence of a non-trivial partition in periodic classes as in Proposition [5l 

Before stating the result, define B = {bj,i P = EbgB > 0 and 

0 = i V^bPb. 

P - 
be/? 

These quantities will play in the forthcoming result the same role as played by B, p and Q in the 
proof of Lemma [2J Under Hypothesis D, this matrix has all the entries of the first column not 
smaller than e, so the Skorohod construction gives a coupling function /* : G x [0,1] —> G that, 
in addition to the property that f*{g, U) has the law Q{g, ■) when U is uniform in (0,1), for any 
g € G, satisfies 

f*{9, u) = l,g e G,0 < u < e. (28) 

We warn the reader that this coupling function enters explicitly in Simulation Algorithm 3 
presented below, differently from what happened in the algorithms of the previous section. If the 
cardinality of G is large or infinite the computation of Q, and consequently of the coupling function 
/*, can give rise to accuracy and computational time problems. This is the reason for which, even 
if the following theorem applies to G finite as well, and as such it provides a perfect simulation 
algorithm under the uniqueness regime which is free of error, in practice it may be preferable to 
accept a small error introducing a truncation threshold in the simpler Simulation Algorithm 2, 
which, in addition to the {Ki,Uiys, requires only the computation of the invariant distribution A. 
Moreover Simulation Algorithm 3 does not apply in a situation of non uniqueness, as in the non 
aperiodic case. 

Theorem 7. Under Hypothesis D, there exists a unique element in Q{p), whose restriction to any 
finite A C is the law of the random vector Xa given by Simulation Algorithm 3. 

Simulation Algorithm 3 

1. SetV = A; 

2 . IfV = ^ stop. 

3. Otherwise set m = max V; 

4 - Construct the random walk and check if a subpath corresponding to a word in B appears 
before it lands on or below a site s in V (this is possible only if the distance between m and 
V \ {m} exceeds uq); 

5. If such a subpath appears connecting, say, the sites u and I (with u — I = uq), extract U* 
uniform in ( 0 ,1), independent of all the variables generated previously: ifU* < s, set = 1 , 
compute by forward simulation Xm, using ([ 6 ]) and the coupling functions f*{U*;-) on the 
previously located segments {l,u], together with Xn,n G A for all n such that hits m, 
delete m from V and go to 2.; 
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6. Otherwise replace m by s in V (if s is already in V this means that the two random walks 
have coalesced) and go to 2. 

Proof. The main to prove is that the above algorithm stops in a random but a.s. hnite time. We 
already know that any two walks and with m ^ n, either coalesce, or their distance 

go to infinity a.s. As a consequence it happens with probability zero that the rightmost walk, 
constructed from site m at step 3, only finitely many times is at distance larger than reo from the 
current position of the other ’’surviving” walks, stored in the set V. Each time it is at a distance 
larger than ho there is a probability pe to stop the walk with the determination of the value 
at some site u, independently of the outcome of all previous simulations. Hence a finite number 
of walks will all be stopped in a finite time a.s. We call W the leftmost site where a random walk 
is stopped. 

Next consider any process of the form constructed through the recursion ([7|). If this rule 
is modified by using on each interval {I, ii\ where a path in B has occurred the coupling function /*, 
such process can be represented on the same probability space where the algorithm is constructed, 
without changing its law. Again by the coupling inequality the variation distance between Xa and 
X^'^ will be bounded by the probability that {W < r}. The proof is completed by sending r to 

—CX). □ 
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